DIGITAL WATERMARING METHOD AND APPARATUS 



BACKGROUND OF THE INVENTION 

1 . Technical Field 

The present invention relates to digital watermarking of data, including 
audio, video, and multimedia data. Specifically, the invention relates to 
embedding a watermark signal into digital audio data. 

2. Description of the Related Art 

The proliferation of digitized media such as image, video and multimedia is 
creating a need for security system which facilitates the identification of the 
source of the material. Particularly, the internet is increasingly used for 
transmitting recorded music in a digitized format Content providers, i.e., 
owners of such recorded music in digital form, have a need to embed into 
multimedia data a predetermined mark which can subsequently be detected by 
software and/or hardware devices for purposes of authenticating copyright 
ownership, control and management of the multimedia data. Digital 
watermarking has been developed as a technique for embedding an identifiable 
data into multimedia data. 

Conventionally, a watermark signal used for watermarking audio signal has 



been relatively simple signals such as a sequence of code symbols because, 
unlike image or video, inserting a large watermark signal would affect original 
audio perceptibility. Therefore, a watermarking technique employing a large 
image as a watermark signal has been proposed. However, prior arts 
watermarking techniques involving an image watermark are susceptible to 
unauthorized removal of watermarks, thereby making hard to trace the origin of a 
copyright protected material. 

SUMMARY OF THE INVENTION 



An objective of the present invention is to provide a digital watermarking 
technique that does not allow easy removal by an unauthorized person of a 
watermark signal embedded in digital data, particularly audio signal data and yet 
minimize distortion of original data. The objective is achieved in part by 
correlating the coefficients of wavelet transformation of magnitudes of Fourier 
transformed audio signal with the coefficients of discrete cosine transformed 
watermark signal. The coefficients of transformed audio signal data and scaled- 
down coefficients of watermark signal are added, inverse wavelet transformed 
and inverse Fourier transformed to produce watermarked audio signal data. 

In accordance with one aspect of the present invention, a method for 
inserting a watermark signal into audio signal data comprises the steps of: Fourier 
transforming audio signal data in the frequency domain in a form of first 
components and second components; wavelet transforming absolute values of the 



first components to generate first spectral coefficients; discrete cosine 
transforming a watermark signal to generate second spectral coefficients; 
combining the first spectral coefficients and the second spectral coefficients; and 
Inverse wavelet transforming the combined coefficients. 

The first components and second components may be the magnitudes and 
phases of coefficients respectively. Preferably, the step of combining includes a 
step of performing a weighted addition of the first and second spectral 
coefficients. It is preferable for the method to further comprise a step of inverse 
Fourier transforming the output of the inverse wavelet transforming by using the 
phases of coefficients. Also, it is preferable for the method to further comprise a 
step of multiplying information from the first spectral coefficients to the second 
spectral coefficients prior to the combining step. Further, the method may 
comprise a step of multiplying a scaling factor to the second spectral coefficients 
prior to said combining step. The scaling factor may be in the range of 0.01 ~ 
0.05. Preferably, the information is a function of the sign of the first spectral 
coefficients. 

In accordance with another aspect of the present invention, a method for 
extracting a watermark from a watermark-embedded audio data comprises the 
steps of Fourier transforming a watermark-embedded audio data and an original 
audio data to generate the first components and the second components 
respectively; Wavelet transforming the absolute magnitudes of the first 
components of the watermark-embedded audio data and the original audio data, 
respectively; taking the differences between wavelet-transform coefficients of the 



watermark-embedded audio data and the original audio data; and inverse-discrete 
cosine transforming the differences. 

Preferably, the method further comprise a step of multiplying the sign of 
the wavelet-transform coefficients associated with the original audio data to 
wavelet-transform coefficients associated with the watermark-embedded audio 
data. Further, the multiplying step may comprise a step of multiplying a scaling 
factor to wavelet coefficients associated with the watermark-embedded audio 
data. The sign may be obtained by using a signum function. The scaling 
factor may be in the range of 20 - 100. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The aforementioned aspects and other features of the invention will be 
explained in the following description, taken in conjunction with the 
accompanying drawings wherein: 

Fig. 1 is a block diagram for inserting a watermark signal into audio signal 
data according to the present invention; and 

Fig. 2 is a block diagram for extracting a watexmark signal from the 
watermark embedded audio signal data. 

DETAILED DESCRIPTION OF THE PREFERRED INVENTION 

Referring to Fig. 1 , a digital watermarking method and system according to 



the present invention will be described. 

When a watermark signal is transformed using a transformation scheme, 
the shape of the original watermark is not preserved. The present invention is 
based on the idea that a watermark of an impulse type is hard to delete because 
the watermark, after inventive transformations, would be distributed over the 
whole transform plane. Thus it helps to prevent unauthorized copying of a 
legitimate data. 

Among many transformation schemes, the present invention employs DCT 
to transform a watermark, because coefficients of DCT transformed plane are real 
values, whereas coefficients of Fourier-transformed plane have complex 
components, making it more difficult to match with original image data. 

When inserting a watermark (W) into original audio data (S) to form a 
watermark-embedded audio data (S*), the quality of the watermark-embedded 
audio data (S') can be controlled by adjusting the interval between the original 
audio data (S) and the watermark (W) using a scaling parameter a, as shown in 



Eq. 1. 



[Equation 1] 
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Eq. la is always invertible. Eqs. lb and 1c are invertible when Wi * 0. 
If Eqs. lb and lc are employed, the security of watermarks may not be 
maintained for various processes in multimedia applications. Thus, the present 
5 invention utilizes Eq. la. 

Figs. 1 and 2 show processes of watermarking original digital data and 
extracting the watermarks, in accordance with the present invention. Referring 
to Fig. 1 , a process of watermarking original digital data will be described. 

When original audio data to embed a watermark is inputted to processing 
10 means (not shown in the figure), the processing means Fourier-transforms the 
original audio data by using a predetermined algorithm to generate amplitude and 
phase components. A Fourier Series is used for the Fourier transform, as 
follows: 

15 [Equation 2] 



-6- 



PI 



05 



us 



The process of Fourier-transforming a continuous function f(x) using the 
infinite series of Eq. 2 may be defined as Eq. 3. 

[Equation 3] 

X(f) = f x{t)e~ J ™dt 
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In the Fourier transform as defined in Eq.3, it is preferable to use complex 
values, since a complex value may represent both the amplitude and phase at a 
time, as shown below. 



C? 

Is:? 



[Equation 4] 



F(u) = R(u) + jl(u) 
F(u) = \F(u)\e J * iu) 



In Eq. 4, Fourier spectrum is expressed as: 

F( w )| = [a 2 («)+/ 2 ( M )] 1 



the phase is expressed as: 
10 ^(w) = tan -1 ' 



{ /(») ' 
[*(*). 



the power spectrum is expressed as: 
P( W ) = |^( W )| 2 =/? 2 ( W ) + / 2 ( W ) 

1 5 where u represents a variable for frequency. 

By employing Euler's equation, i.e., exp[-j2itux] = cos2nx - jsin2nx 7 the 
Fourier transform can be represented by the equation defined in Eq, 5. 
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[Equation 5] 



T{f(x*y)} = F(u,v) = £ j^f(x,y)cxp[-j27r(ux^vy)]dxdy 
T' 1 {f(u, v)} = F(x, >0 = £ £ v > Gxp[j2*c(ux + yy)]dMrfv 

Therefore, the Fourier spectrum, phase, and power spectrum can be given 
as follows: 

Fourier spectrum: 

1 

|F(«,v)| = [* 2 («,v) + / 2 (k,v)] 5 

Phase: 

<D(w,v) = tan H 
Power spectrum: 

P(u,v) = \F(u,v)\ 2 = K 2 (k,v) + / 2 (k,v) 

As shown above, Fourier transform employs infinite series to transform 
analog signals to sampled digital signals. However, in order to implement 
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Fourier transform by a computer, modified Fourier transform for sampled data, 
i.e., Discrete Fourier Transform (DFT) is used on behalf of Fourier transform. If 
DFT is employed, f(x) can be given as Eq. 6. 



[Equation 6] 
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Also,inverseDFT X(e N ) is defined as Eq. 7, when the rotational initial 
and maximum value is X[n]. 



[Equation 7] 
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Digital audio data is Fourier transformed at a Fourier transformer 10 as 
described above while a watermark signal is discrete cosine transformed at a 
discrete cosine transformer 14. Next, the magnitudes of the coefficients of 

5 Fourier transformed audio data, obtained by a magnitude extractor 11, are 
wavelet transformed at a wavelet transformer 13. Now the signs (+> 0) of the 
audio's coefficients are respectively multiplied to the spectral coefficients of the 
watermark signal at the first multiplier 31 in order to correlate the audio signal 
and the watermark signal to certain extent. The sign can be easily obtained by 

10 using the signum function unit 15, which outputs 1,-1 or 0 depending on the 
sign/polarity of an input value disregarding the magnitude. The spectral 
coefficients of the watermark signal are further multiplied by a scaling factor a at 
the second multiplier 32 so as not change the audio signal's quality as perceived 
by the listener. The scaling factor is preferably in the range of 0.01 to 0.05. In 

15 other words the influence of the scaled watermark signal's coefficients on the 
spectral shape of the audio data is minimized so that watermark-embedded audio 
signal is perceptively no different from the original audio signal from the 
perspective of the listener. The scaled coefficients are then added to the 
coefficients of wavelet transformed audio signal data at an adder 30. The added 

20 coefficients are inverse wavelet transformed at an inverse wavelet transformer 16 
to generate adjusted coefficient magnitudes. Finally, the adjusted magnitudes, 
generated by the inverse wavelet transformer, and the phase component of the 
audio signal data, obtained by a phase extractor 12, are input to an inverse 
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Fourier transformer 17 to finally generate watermark-embedded audio data. 

Next a watermark extraction from a watermark-embedded audio data will 
be described referring to Fig. 2. A watermark-embedded audio data undergoes a 
Fourier transform at a Fourier transformer 20 to generate a first set of coefficients 
5 in the frequency domain. Simultaneously or independently an original audio data 
is also Fourier transformed at a Fourier transformer 23 to generate a second set of 
coefficients in the frequency domain. The magnitudes of the two set of 
coefficients, obtained by magnitude extractors 21 and 24 respectively, are further 
^ wavelet transformed at wavelet transformers 22 and 25 respectively. The 

yj io wavelet coefficients associated with the original audio data are subtracted from 

bj those with the watermark-embedded audio signal at a subtracter 33. The 

C3 

05 differences in the coefficients are multiplied by a scaling factor (l/a) and the sign 

C3 (1 for positive, 0 for none and -1 for negative) of the wavelet transform 

*\i coefficients associated with the original audio data at "a multiplier 34. The sign 

15 can be obtained by using a signum function unit 26. Finally, the scaled 
coefficients, multiplied by the output of the signum- function unit 26 is inverse 
discrete cosine transformed at an inverse discrete cosine transformer 27 to 
produce a watermark which had been embedded in the original audio data. 

The watermarking method described above can be implemented on a single 
20 chip integrated circuit or discrete components. Specifically, a digital signal 
processor may be programmed to perform the steps in the inventive 
watermarking. 

While there has been described and illustrated a method and system for 
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inserting a watermark data by discrete cosine transforming the watermark signal 
and Fourier/ wavelet transforming an original audio data, it will be apparent to 
those skill in the art that variations and modifications are possible without 
deviating from the broad principles and teachings of the present invention which 
shall be limited solely by the scope of the claims appended hereto. 
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